ZymCTRL is a conditional language model for generating artificial functional enzymes, trained on more than 37 million enzyme commission (EC) annotated sequences from the UniProt database. When users provide a specific EC number, the model can generate protein sequences that meet the corresponding catalytic reactions. The generated sequences are ordered, globular, and significantly different from natural sequences while maintaining the expected catalytic properties.
Scientific Computing
Transformers